首页> 外文OA文献 >Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks

【2h】

Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks

机译：均衡量化：一种有效有效的量化方法神经网络

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

获取外文期刊封面目录资料

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Quantized Neural Networks (QNNs), which use low bitwidth numbers forrepresenting parameters and performing computations, have been proposed toreduce the computation complexity, storage size and memory usage. In QNNs,parameters and activations are uniformly quantized, such that themultiplications and additions can be accelerated by bitwise operations.However, distributions of parameters in Neural Networks are often imbalanced,such that the uniform quantization determined from extremal values may underutilize available bitwidth. In this paper, we propose a novel quantizationmethod that can ensure the balance of distributions of quantized values. Ourmethod first recursively partitions the parameters by percentiles into balancedbins, and then applies uniform quantization. We also introduce computationallycheaper approximations of percentiles to reduce the computation overheadintroduced. Overall, our method improves the prediction accuracies of QNNswithout introducing extra computation during inference, has negligible impacton training speed, and is applicable to both Convolutional Neural Networks andRecurrent Neural Networks. Experiments on standard datasets including ImageNetand Penn Treebank confirm the effectiveness of our method. On ImageNet, thetop-5 error rate of our 4-bit quantized GoogLeNet model is 12.7\%, which issuperior to the state-of-the-arts of QNNs.

机译：提出了使用低位宽数字表示参数和执行计算的量化神经网络（QNN），以降低计算复杂性，存储大小和内存使用量。在QNN中，参数和激活被统一量化，从而可以通过按位运算来加速乘法和加法运算。在本文中，我们提出了一种新颖的量化方法，可以确保量化值分布的平衡。我们的方法首先将参数按百分位数递归划分为平衡仓，然后应用统一量化。我们还介绍了百分位数的更便宜的计算近似值，以减少引入的计算开销。总体而言，我们的方法提高了QNN的预测精度，而无需在推理过程中引入额外的计算，对训练速度的影响可以忽略不计，并且适用于卷积神经网络和递归神经网络。在包括ImageNet和Penn Treebank在内的标准数据集上进行的实验证实了我们方法的有效性。在ImageNet上，我们的4位量化GoogLeNet模型的top-5错误率是12.7％，这比QNN的最新水平要好。

著录项

作者
Zhou, Shuchang; Wang, Yuzhi; Wen, He; He, Qinyao; Zou, Yuheng;
展开▼
作者单位

展开▼
年度 2017
总页数
原文格式 PDF
正文语种
中图分类

相似文献

外文文献
中文文献
专利

1. Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks [J] . Shu-Chang Zhou, Yu-Zhi Wang, He Wen, 计算机科学技术学报（英文版） . 2017,第004期

机译：平衡量化：一种有效的量化神经网络方法
2. Efficient Weights Quantization of Convolutional Neural Networks Using Kernel Density Estimation based Non-uniform Quantizer [J] . Sanghyun Seo, Juntae Kim Applied Sciences . 2019,第12期

机译：基于基于核密度估计的非统一量化器的高效权重量化卷积神经网络
3. Quantized Neural Modeling: Hybrid Quantized Architecture in Elman Networks [J] . Penghua Li, Yi Chai, Qingyu Xiong Neural processing letters . 2013,第2期

机译：量化神经建模：Elman网络中的混合量化架构
4. Hybrid Approach for Efficient Quantization of Weights in Convolutional Neural Networks [C] . Sanghyun Seo, Juntae Kim IEEE International Conference on Big Data and Smart Computing . 2018

机译：卷积神经网络中有效量化权重的混合方法
5. Algorithmic Techniques Towards Efficient Quantization of Deep Neural Networks [D] . ?Youssef, Ahmed 2020

机译：深神经网络有效量化的算法技术
6. Neural Classification of Compost Maturity by Means of the Self-Organising Feature Map Artificial Neural Network and Learning Vector Quantization Algorithm [O] . Piotr Boniecki, Małgorzata Idzior-Haufa, Agnieszka A. Pilarska, 2019

机译：基于自组织特征图人工神经网络和学习矢量量化算法的堆肥成熟度神经分类
7. Effective Quantization Approaches for Recurrent Neural Networks [O] . Md Zahangir Alom, Adam T Moody, Naoya Maruyama, 2018

机译：经常性神经网络的有效量化方法
8. Complexity optimized vector quantization: A neural network approach [R] . Buhmann, J, Kuehnel, H 1992

机译：复杂性优化矢量量化：神经网络方法

Balanced Quantization: An Effective and Efficient Approach to Quantized Neural Networks

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅